All Questions
Tagged with pytorchtensorflow
74 questions
0votes
0answers
17views
Custom loss function not behaving as expected in PyTorch but does in TensorFlow
I tried modifying the reconstruction loss such that values that are pushed out of bounds do not contribute to the loss and it works as expected in tensorflow after training an autoencoder. However, ...
0votes
2answers
42views
Embeddings for multiple categorical features with different cardinality
If I have multiple categorical features, each which has its own unique cardinality, and I want to use an embedding layer to reduce the dimensions fed to an MLP. Should I have one big embedding matrix ...
0votes
0answers
14views
Are group convolutions still used?
Group convolutions theoretically should reduce the number of parameters and hence improve the speed of inference, without significantly affecting the performance of the model. However, I don't notice ...
0votes
0answers
23views
tensorflow to pytorch weight transfer
There is a modified efficientnet TF model, that I'm trying to simulate in pytorch. I have made the architecture changes to the model in pytorch, dumped the TF model weights, and loaded them back in ...
0votes
0answers
47views
How to Prepare Data for U-Net Model Training with .tif Images
I'm new to image segmentation and trying to train a U-Net model. I have a dataset consisting of .tif satellite images and their corresponding annotations. Here is a sample of my data: ...
0votes
1answer
34views
Where should I learn pytorch from?
I'm a undergraduate student. I've coded a three-node neural network (that works) based on my professor's guidance. However, I'd like to pursue a career in AI and Data Science, and I'd like to teach ...
0votes
0answers
11views
Vocal Tract Length Perturbation (VTLP) code
I'm looking for an example of code that uses the VLP (Vocal Tract Length Perturbation) technique to improve speech recognition. Regards
1vote
1answer
60views
wierd neural network approache
I'm working on a problem where I need to create a neural network to optimize the seating arrangement for 24 unique individuals in a 6x4 grid, minimizing conflicts between adjacent (up,down,left,right) ...
0votes
1answer
34views
How to get the closest samples to time series dataset?
I have a deep learning time series classification model. I want to understand if the model failed to classify, due to missing or incorrupt training inputs. For simplicity let's say we have a training ...
1vote
1answer
165views
How to add a new item in the embeddings vocabulary?
Imagine you have trained a model containing an Embedding layer. Your model performs well and you're happy with your embedding. Then, suddenly, you want to add a new item in your vocabulary. In other ...
0votes
0answers
344views
How to use pretrained encoder for customized Unet
if you have a standard Unet encoder such as resnet50, then it's easy to add pertaining to it. for example: ...
0votes
1answer
536views
Effect of hyperparameters: the hidden size, layers, MLP size number of heads on Transformer
Is there any paper that explains the effect of hyperparameters: hidden size Number of layers MLP size number of heads on Transformer performance. I found some explanation on the web but I need ...
0votes
1answer
105views
Query about Sigmoid activation function calculation
While applying sigmoid activation function (in finding y label), I have calculated it as below: y = 0.35 + (0.8 * 0.1) + (0.3 * 0.6) + (-0.2 * 0.4) = 0.53 sigmoid_y = 0.625 how do we take threshold ...
4votes
1answer
868views
Should a Learning Rate Scheduler adjust the learning rate by optimization step (batch) or by epoch?
In PyTorch doc, it suggests torch.optim.lr_scheduler provides several methods to adjust the learning rate based on the number of epochs. However, from other ...
1vote
1answer
744views
How to handle OOV in non-contextual embedding (word2vec, Glove, FastText)?
how non-contextual embedding (Word2Vec, Glove, FastText) handle OOV (incase if given word is not available in vocabulary)